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Of '.he approximately 40 languages in Uganda, some are very similar to one 
another and may be to some extent mutually intelligible. Because no one knows how to 
measure degrees of mutual intelligibility, the authors are attempting to establish 
reliable techniques which would be not only of practical value for the study of 
language problems in Uganda, but also theoretically interestma for sociolinguistics 
Discussed in this paper are possible techniques tor testing mutual 
intelligibility — asking a number of informants to assess the extent to which they can 
understand and follow instructions in the other languages, assessing l;nguistic 
similarities. The authors also propose using a system of comparison of lexical items 
which include common cultural phenomena (avoiding words which may produce a 
number of equivalent forms). A new approach to the finding of cognate words in 
related languages "=^ntails eliciting two kinds of word lists from each informant (1) 
The informant is asKed to give common equivalents for each word. (2) The informant, 
given the word in the neighboring languages, is asked if there is a similar sounding 
word with the same or a related meaning in his own language. Also described is a 
proposal for computing the degree of similarity between each pair of words, the 
distinctive features will be encoded in binary terms. The authors hope that this data 
will be processed at the Kampala Computer Centre. (AMM) 
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The Measurement of cross-language Communication 



There are probably about kO languages in Uganda. Some of them are 
very similar to one another, and may be to some extent mutually intelli- 
gible, But nobody knov/s how to measure degrees of mutual intelligi- 
bility. If we could establish a reliable technique it would be not only 
of practical value for the study of language problems in Uganda but also 
theoretically interesting for sociolinguistics. 

There are basically three ways in which the mutual intelligibility 
of different languages can be assessed. Firstly, informants can be 
asked to estimate the extent to which they can understand neighbouring 
languages. Secondly, one can test the extent to which speakers of one 
language can follow instructions given to them in another. Thirdly, one 
can assess the degree of linguistic similarity (in syntax, lexicon and 
phonology) among the languages in question. We propose using various 
forms of all three of these basic techniques, and examining the corre- 
lations between them. There is no convincing evidence that any of them 
is a valid measure of the difficulty of cross-language communication. 

As we shall see, there are both practical and theoretical problems in 
connection with each of them, and it is not surprising that socio- 
linguists have been reluctant to accept any of them. But since each of 
them is independently motivated (albeit weakly), if we could demonstrate 
a good correlation between any two of them, it would be far more likely 
that they are in fact valid. When such diverse techniques produce 
similar results there are strong grounds for accepting them. 

There are several difficulties in asking informants to assess the 
extent to which they can understand neighbouring languages. In the 
first place one is interested not in the ability of a single informant, 
but in the average ability of all the members of the speech community 
to which he belongs. The best way that I can think of for getting this 
information is to ask a number of informants, each of whom represents 
(in some sense) the local speech community. Any guidance on techniques 
of selecting such informants would be much appreciated. My own pre- 
ference would be for choosing informants who represented the community 
in knowing their views, rather than in being themselves typical. It is 
obvious that the questions should not be of the form ”How much can 



understand of a speech in such and such language,” If skilfully put to 
a knowledgeable informant such questions should produce data about the 
community rather than the individual. Hopefully, the answer should be 
placeable on the scale ”None ; a little; the main points; most; all”. 
If a series of such questions were asked about different hypothetical 
situations, such as ”If someone preached a sermon,,..,; If someone 
told a story..,,,; If someone from such and such a place started 



” but more of the form ”How much an people around here 
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working in this village...." and if a number of different people were 
asked, it should be possible to get a reliable estimate of the beli'^‘fs 
of a speech community concerning the degree of communication possible 
in other languages. These beliefs might, of course, be biassed. It 
has often been shown (e.g. by Wolff 1959, Hymes I 968 ) that the attitude 
of one group of j’oople to another has an effect at least on their belief 
about the mutual intelligibility of their languages. Consequently, the 
informants should be asked additional questions which probe their atti- 
tude to other groups. It may turn out that the results obtained by this 
technique can be correlated with those obtained by other techniques 
only when some weighting factor corresponding to the attitudinal 
differences is taken into account# 

From the theoretical point of view, operational tests of cross- 
language communication seem open to fewer objections; but there are 
considerable practicable difficulties in using them on a country-wide 
scale. The work of the Summer Institute of Linguistics in Mexico is 
interesting in this respect. As far as I can recall (from conversations 
with John Crawford and Peter Landerman) their project involves going 
from village to village both making recordings, and also, in each 
village, playing recordings made in all the surrounding villages in 
order to test the extent to which they could be understood. The 
material which they record usually consists of a short account of some 
normal daily activity. They avoid using folk tales or any convention- 
alised material; and they also prefer not to use translations (which 
may be unreliable) of any of the standard English comprehension tests. 
This policy is no doubt the most appropriate for them; but it leads to 
difficulties in maxing up questions to test the degree of comprehension 
of each passage. In our circumstances, where we have skilled informants, 
I would preler to use both passages and questions which have been care- 
fully matched in their grammar and vocabulary, so that they might be 
presumed to be equal in their degree of difficulty of comprehension. 

But we ar.. still left with the problem of how to select a group who 
would be capable of listening to a series of recorded tests, and somehow 
noting their answers. Classes in the primary schools, or groups of 

church members form a biassed sample of the local population. But they 
are probably the best we can get. 

The third technique, the comparison of linguistic structures, has 
been widely used in historical linguistics and in language teaching 
methodology (where it is known as contrastive analysis). Since we are 
concerned with estimating cross-language communication, our approach 
will be slightly different. Ideally, we want to consider the phonology, 
the syntax, and the lexicon of each of the languages being compared. 

But nobody knows the relative contribution of each of these factors to 
mutual intelljgibility; and in practice it is hard to quantify differ- 
ences in syntactic and phonological patterns. Accordingly we propose 
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using mainly a system of comparison of lexical items, albeit in a 

uew form# Tbe first difference in our methodology lies in 
our choice of vocabulary items to be compared. A common technique 
(Swade i 1950, 1955) is to use a 100 or 200 basic word list, carefully 
selected ^o as to be independent of any particular culture (of 
Gudschinsky 195^). But this is an unnecessary constraint when one is 
studying the cross-language communication among a limited group of 
languages whose speakers share many cultural phenomena. Thus elephants 
are common throughout the country, and are a proper subject of enquiry 
(but one would ’^ot, of course, ask questions about tigers or ice). The 
word list still has to be balanced so that it does nuo refle ^ one 
aspect of the culture unduly. It would be unwioe for it to consist 
entirely of a list of animal names j a language might have borrowed 
these items and no others from its neighbours. But since, for assess- 
ing mutual intelligibility, we are not concerned with whether words 
have been borrowed from another language or not, we do not have to 
limit ourselves to the so-called basic vocabulary which is theoretically 
resistant to change through contact. 

Because of the additional flexibility which is open to us for the 
choice of words to be compared, we can also avoid using words which may 
produce a number of equivalent forms. Thus for an English word like 
’’come’* there may be two or three words each of which might be regarded 
by different investigators or different informants as the correct 
common equivalent in a given language. But, to stick to our previous 
example, an elephant is an elephant, and there is unlikely to be a 
Ugandan language which has more than one generic term for ’’elephant”, 
(They may, of course, have additional specific terms for ’’rogue 
elephant” etc.; but informants normally recognise these as specifics, 
and can easily be induced to give the general one.) 

Vi/o are also going to try a new approach to the finding of cognate 
words in related languages. Nobody knov;s the part played in cross- 
language communication by the existence of cognate words with slightly 
different meanings (such as ’’mutton” in English and ’’mouton” in French); 
without further research I would not like to say whether they help or 
confuse, iVe propose getting two kinds of word lists from each infor- 
mant. First we will elicit the common equivalents for each word. Then 
we will tell the informant the word in the neighbouring languages, and 
ask him if he has a similar sounding word with the same or a related 
meaning in his language. Again, we are trading on the fact that we 
are dealing with sophisticated informants, who are interested in lan- 
guage problems, and who can, with a little training, use their com- 
petence as native speakers to great adv^tage. Computing the degree 
of similarity between two languages using first the common equivalents 
and then the deliberately sought cognates should give us two different 
indices, either one of which might correlate with the other possible 
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indices of cross-«language communication* 

We also propose trying a new way of computing the degree of 
smililarity betv/een each pair of words* Since we are not concerned with 
measuring genetic similarity between languages, we need not stick to 
the criteria which are usual in lexicostatistics or in traditional 
historical studies, sucxi as the necessity of regular, explainable, 
sound changes* All we are worried about is the degree of similarity in 
the sound of the words in the particular pair in question* On the basis 
of previous studies (reference to Wickelgreen (sp?)etc*) it would appear 
that the degree of similarity between two sounds depends on the number 
of phonetic features which they have in common* Accordingly we would 
like to develop a way of comparing each sound in each word in each lan- 
guage with the corresponding sound in each corresponding word in all the 
other languages* V/o are a long way from getting all the bugs out of the 
procedure at the moment. But it might work as follows. 

Consider the data in Table 1* 

TABLE 1 * Unchecked data in orthography on 9 interlacustrine Bantu lan- 
guages 



English 


fire 


water 


sun 


moon 


two 


three 


four 




Luganda 


omuliro 


aniaazi 


enjuba 


omwezi 


biri 


ssatu 


nnya 




Lusoga 


omuliro 


arnadhi 


end’.iuba 


omwezi 


ibiri 


isatu 


ina 


If 


Lugisu 


kumuliro 


kamezi 


inyanga 


kumivezi 


zibiri 


zidatu 


zine 




Lunyole 


omuliro 


amaji 


elyuba 


omwesi 


ebiri 


edatu 


ene 


Lugwere 


omusyo 


amaj.zi 


isana 


omwezi 


ibiri 


isatu 


inna 


Lusamia- 


















Lugwe 


omuliro 


amachi 


eryuba 


omwoozi 


chibiri 


chidatu 


chine 




Rukiga 


omuriro 


amaizi 


eizoba 


okwezi 


ibiri 


ishatu 


ina 


1 


Runyar- 
















wanda 


umuriro 


amazi 


izuba 


ukwezi 


ebyiri 


eshatu 


enye 


Runyoro 


omurro 


amaizi 


izooba 


okwezi 


ibiri 


isatu 


ina 



These are obviously closely related languages* The major problem 
is to ensure that when we quantify the degree of similarity in sound, we 
do so by comparing each segment with the appropriate segment in the 
corresponding word in the other languages. There are two steps to the 
solution we have chosen, I'irstly, each word will be normalised so that 
the first consonant of the stem begins in a given location. Secondly, 
each word will be transcribed so that it consists of an alternating 
sequence of vov/els and consonants. Double vowels and double consonants 
marked by a superimposed length mark; and the consonant com- 
pounds will be treated as single units with features of prenasility, or 

ion , or labiovelarisation. Both these steps involve untested, 
but seemingly reasonable, assumptions about the subjective degree of 
phonetic similarity between cognate words in closely related languages. 

It also seems reasonable to assume that v/hen the same word in^ two 
different languages has a similar form but a different number of 
segments in each language, then the extra segments should be considered 
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as contributing only slightly to the perceptual differences. This will 
be achieved by adding dummy segments (unmarked segments, in the 
Chomsky -Halle sense) so that all words in a set have the same length. 

We might also consider the possibility of calculating differences in 
stems or roots separately from those in affixes. It is at least 
possible that speakers of related Bantu languages will be less put ouo 
by differences in prefixes than by differences in stems. 

Each segment in each word will be fully specified in terms of a 
set of distinctive features. The distinctive features chosen will not 
be those of Jakobson & Halle (1955) or Chomsky & Halle (I 968 ) or 
Ladefoged (I 967 ), but will be a set appropriate for the population to 
be examined, namely, Ugandan languages. All the features will be 
binary. This is not to be taken as a reversal of any previous 
position stated in Ladefoged (19^7 and elsewhere) » but is partly for 
ease in computer processing, and partly because, at this stage of our 
ignorance, I would rather encode ternary and quaternary features in 
binary terms so as to avoid prejudging the issue of whether or not 
they are linearly ordered sets. The features are also set up so that 
certain redundancies are built in. This will result in some segments 
which might have been considered to differ in only one feature (in, 
say, the Jakobson-Halle system), nevertheless being counted as 
different in two. The redundancies have been used so that segments 
are characterised in a way which seems intuitively (an on the basis 
of the experimental evidence for English) to correspond to the degree 
of phonetic difference between them'. The possible segments for 
Luganda are shown in Table 2 on page 6. The difference between each 
of them is shown in the matrix in Table 3 on page 6, 

It is hoped that all the data will be processed at the Kampala 
Computer Centre. Table 4 (page 6) exemplifies the inputs corres- 
ponding to two of the words in Table 1. It may be seen that these 
two words differ by a total of 7 points. If we added the total 
differences in 100 words, we should have a fairly sensitive measure 
of the degree of difference between two languages. I’or each pair of 
word lists we should get at least two counts, one in which only the 
stems were compared (say columns 5~10, if the first consonant of the 
stem had always been placed in column 5)» and one in which whole 
words were compared (say columns 1—10), This assumes that no word 
has a prefix longer than 4 segments, or a stem longer than 6. 

We hope to gather matched data in the form of 100-word lists from 
each of about 40 languages. These languages will be split into k 
groups, so that vn compare only languages which belong to the same 
family. It would be virtually impossible to use this teclmique for 
languages which differ greatly. But this is not a source of concern, 
since we would not expect to find any cross-language communication 
between these languages. If this technique for assessing mutual 
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intelligibility can be shown to b© valid (by correlating it with the 
other techniques), it will be extremely convenient. It is obviously 
far more reliable than any technique which depends for its success on 
the careful selection of informants. To compile a good 100-word list 
one needs a single intelligent informant who knows the language well; 
one docs not need to find a group of informants whose I.Q.’s and 
linguistic backgrounds are a representative sample of their speech 
community. 



TABLE 2 ! Distinctive features of some of possible consonantal 

segments in Bantu languages. (To be extended later.) 
nny is the orthographic representation of a long palatal 
nasal. These features have been chosen and the values 
for segments assigned as indicated not for the purpose 
of setting up appropriate natural classes for use in 
phonological descriptions, but only so as to provide a 
measure of the degree of a phonetic similarity between 
segments (as shown in Table ^,) 0 = +j l =; — 



stop 

nasal 

fricative 

approximant 

voiced 

frontal 

coronal 

long 

labialised 

palatalised 



p . b m 

0 o 1 

1 1 o 

111 
111 
loo 
o o o 

111 
111 
111 
111 



PP mp py 

0 o o 

1 o 1 

111 
3. 1 1 

111 

0 o o 

111 
oil 
111 

1 1 o 



pw f t c 

0 loo 

1 111 
1 oil 
1 111 
1 111 

0 o o 1 

1 loo 
1 111 

0 111 

1 111 



k nny 

0 1 

1 o 
1 1 
1 1 
1 o 
1 1 
1 o 
1 o 
1 1 
1 1 



Number of differences in the feature specifications of 
some pairs of consonants in Bantu languages 



P 

b 

m 

PP 

mp 

py 

pw 

f 

t 

c 

k 

nny 



p b m PP mp 

0 13 1 1 

0 2 2 2 

0 4 2 

0 2 

0 



py pw f t c k nny 

1 1 2 12 1 6 

2 2 3 2 3 2 3 

4 4 3 4 3 4 3 

2 2 3 2 3 2 3 

2 2 3232 3 

0 2 3232 6 

0 3232 6 

0343 e 

0123 

0 14 

0 3 

0 



TABLE 3 ; 
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TABLE_Jf: Matrices for the comparison of the Lusoga and the 

Lugisu words for '’three" (unchecked data) 



Lugisu Lusoga 


z 


• 

1 


d 


a 


t 


U i 


I mt 

} 


i 


s 


a 


t 


u 


Segment number 


1 


2 


3 


4 


5 


6 


1 

i 1 


2 


3 


4 


3 


6 


odd 

numbers 


even 

numbers 


























stop 


high 


1 


o 


o 


1 


0 


o 


■ 1 


o 


1 


1 


0 


0 


nasal 


mid 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


fricative 


low 


o 


1 


1 


o 


1 




1 


1 


o 


o 


1 


1 


approximant 


front 


1 


o 


1 


1 


1 


1 


‘ 1 


o 


1 


1 


1 


1 


voiced 


central 


o 


1 


o 


o 


3 


1 


1 


1 


1 


o 


1 


1 


frontal 


back 


o 


1 


o 


1 


o 


o 


1 


1 


o 


1 


o 


o 


coronal 


long 


o 


1 


o 


1 


o 


1 


. 1 


1 


o 


1 


o 


1 


long 


y-glido 


1 


1 


1 




1 


1 


1 


1 


1 


1 


1 


1 


labialised 


high tone 


1 


1 


1 


1 


1 


1 


M 




1 


1 


1 


1 


palatalised 


lov/ tone 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


TABLE 5: 


Distinctive features of some of the 
segments in Bantu languages (Tonal 
added to this set when appropriate) 


possible vocalic 
features would bo 




i e 


a 


o 




u 


aa 


ay 












high 


o 1 


1 


1 




o 


1 




1 












mid 


1 o 


1 


o 




1 


1 




1 












low 


1 1 


o 


1 




1 


o 




0 












front 


o o 


1 


1 




1 


1 




1 












central 


1 1 


o 


1 




1 


o 




o 












back 


1 1 


3. 


o 




o 


1 




1 












long 


1 1 


1 


1 




1 


o 




1 












y-glide 


1 1 


1 


1 




1 


1 




o 












TABLE 6: 


Number of differences 
some pairs of vowels ; 


in the 
in Bantu 


feature specifications of 
languages 




i e 


a 


o 




u 


aa 


ay 












i 


0 2 


h 


4 




2 


3 




3 












o 


0 




2 




4 


3 




3 












a 




0 


4 




4 


1 




1 












D 






0 




2 


3 




5 












u 










0 


3 




3 












aa 












0 




2 












ay 
















0 















